ActionFlowNet: Learning Motion Representation for Action Recognition

نویسندگان

  • Joe Yue-Hei Ng
  • Jonghyun Choi
  • Jan Neumann
  • Larry S. Davis
چکیده

Even with the recent advances in convolutional neural networks (CNN) in various visual recognition tasks, the state-of-the-art action recognition system still relies on hand crafted motion feature such as optical flow to achieve the best performance. We propose a multitask learning model ActionFlowNet to train a single stream network directly from raw pixels to jointly estimate optical flow while recognizing actions with convolutional neural networks, capturing both appearance and motion in a single model. We additionally provide insights to how the quality of the learned optical flow affects the action recognition. Our model not only significantly improves action recognition accuracy by a large margin (17%) compared to state-of-theart CNN-based action recognition models trained without external large scale data and additional optical flow input, but also produces the optical flow as a side product.

منابع مشابه

Action Change Detection in Video Based on HOG

Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one ...

متن کامل

Action Chart: A Representation for Efficient Recognition of Complex Activities

In this paper we propose an efficient method for the recognition of long and complex action streams. First, we design a new motion feature flow descriptor by composing low-level local features. Then a new data embedding method is developed in order to represent the motion flow as an one-dimensional sequence, whilst preserving useful motion information for recognition. Finally attentional motion...

متن کامل

3D Hand Motion Evaluation Using HMM

Gesture and motion recognition are needed for a variety of applications. The use of human hand motions as a natural interface tool has motivated researchers to conduct research in the modeling, analysis and recognition of various hand movements. In particular, human-computer intelligent interaction has been a focus of research in vision-based gesture recognition. In this work, we introduce a 3-...

متن کامل

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Robot Motion and Task Learning with Error Recovery

The ability to learn is essential for robots to function and perform services within adynamic human environment. Robot programming by demonstration facilitates learningthrough a human teacher without the need to develop new code for each task that therobot performs. In order for learning to be generalizable, the robot needs to be ableto grasp the underlying structure of the task...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • CoRR

دوره abs/1612.03052  شماره 

صفحات  -

تاریخ انتشار 2016